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Abstract 

A trie is a search tree scheme that employs the structure of search keys to organize in- 
formation. Tries were originally devised as a means to represent a collection of records 
indexed by strings over a fixed alphabet. Based on work by CP. Wadsworth and others, 
R.H. Connelly and F.L. Morris generalized the concept to permit indexing by elements 
of an arbitrary monomorphic datatype. Here we go one step further and define tries and 
operations on tries generically for arbitrary first-order polymorphic datatypes. The deriva- 
tion is based on techniques recently developed in the context of polytypic programming. It 
is well known that for the implementation of generalized tries nested datatypes and poly- 
morphic recursion are needed. Implementing tries for polymorphic datatypes places even 
greater demands on the type system: it requires rank-2 type signatures and higher-order 
polymorphic nested datatypes. Despite these requirements the definition of generalized 
tries for polymorphic datatypes is surprisingly simple which is mostly due to the frame- 
work of polytypic programming. 



All generalizations are dangerous, even this one. 

— Alexandre Dumas 



1 Introduction 

The concept of a trie was introduced by A. Thue in 1912 as a means to represent a 
set of strings, see (Knuth, 1998). In its simplest form a trie is a multiway branching 
tree where each edge is labelled with a character. For exam- 
ple, the set of strings {ear, earl, east, easy, eye} is represented 
by the trie depicted on the right. Searching in a trie starts at 
the root and proceeds by traversing the edge that matches the 
first character, then traversing the edge that matches the second 
character, and so forth. The search key is a member of the repre- 
sented set if the search stops in a node which is marked — marked 
nodes are drawn as filled circles on the right. Tries can also be 
used to represent finite maps. In this case marked nodes addi- 
tionally contain values associated with the strings. Interestingly, 
the move from sets to finite maps is not a mere variation of the theme. As we shall 
see it is essential for the further development. 
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At a more abstract level a trie can be seen as a composition of finite maps. Each 
collection of edges, descending from the same node, constitutes a finite map sending 
a character to a trie. With this interpretation in mind it is relatively straightforward 
to devise an implementation of string-indexed tries. For concreteness, programs will 
be given in the functional programming language Haskell 98 (Peyton Jones et al., 
1999). If strings are defined by the following datatype 

data Str = Nil | Cons Char Str , 
we can represent string- indexed tries as follows. 

data MapStr v = TrieStr (Maybe v) (MapChar (MapStr v)) 

The first component of the constructor TrieStr contains the value associated with 
Nil. Its type is Maybe v instead of v, since Nil may not be in the domain of the 
finite map. In this case the first component equals Nothing. The second component 
corresponds to the edge map. To keep the example manageable we assume that a 
suitable data structure, MapChar, and an associated look-up function lookupChar 
are predefined. Now, to lookup a non-empty string, say, Cons c s we lookup c in 
the edge map obtaining a trie which is then recursively searched for s. 

lookupStr :: Str — > MapStr v — > v 

lookupStr Nil (TrieStr Nothing tc) = error "not found" 

lookupStr Nil (TrieStr (Just v) tc) = v 

lookupStr (Cons c s) (TrieStr tn tc) = (lookupStr s o lookupChar c) tc 

Based on work by CP. Wadsworth and others, R.H. Connelly and F.L. Morris 
(1995) have generalized the concept of a trie to permit indexing by elements of 
an arbitrary monomorphic datatype. The definition of lookupStr already gives a 
clue how a suitable generalization might look like: the trie TrieStr tn tc contains a 
finite map for each constructor of the datatype Str; to lookup Cons c s the look-up 
functions for the components, c and s, are simply composed. The type constructor 
Maybe can be seen as implementing finite maps over the unit datatype. Generally, if 
we have a datatype with k constructors, the corresponding trie has k components. 
To lookup a constructor with n components, we must select the corresponding 
finite map and compose n look-up functions of the appropriate types. To illustrate, 
consider the datatype of external search trees. 

data Bin = Leaf Str | Node Bin Char Bin 
The trie for external search trees is given by 

data MapBin v = TrieBin (MapStr v) 

(MapBin (MapChar (MapBin v))) . 

The type MapBin is an instance of a so-called nested datatype (nest for short). The 
term 'nested datatype' has been coined by R. Bird and L. Meertens (1998) and 
characterizes polymorphic datatypes whose definition involves 'recursive calls' — 
MapBin (MapChar (MapBin v)) in the example above — which are substitution 
instances of the defined type. Functions operating on nested datatypes are known 
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to require a non-schematic form of recursion, called polymorphic recursion (Mycroft, 
1984). The look-up function on external search trees may serve as an example. 

lookupBin :: Bin — > MapBin v — > v 

lookupBin {Leaf s) (TrieBin tl tn) = lookupStr s tl 
lookupBin (Node I c r) ( TrieBin tl tn) 

= (lookupBin I o lookupChar c o lookupBin r) tn 

Looking up a node involves two recursive calls. The second, lookupBin r, is of type 
Bin — > MapBin (MapChar (MapBin v)) — > MapChar (MapBin v) which is a sub- 
stitution instance of the declared type. Haskell allows polymorphic recursion only 
if an explicit type signature is provided for the function (s). The rationale behind 
this restriction is that type inference in the presence of polymorphic recursion is 
undecidable (Henglein, 1993). 

Note that it is absolutely necessary that MapBin and lookupBin are paramet- 
ric with respect to the codomain of the finite maps. If we restricted the type of 
lookupBin to Bin — > MapBin s — > s for some fixed type s, the definition would no 
longer type-check. This also explains why the construction does not work for the 
finite set abstraction. 

From the discussion above it should be clear how to define tries for arbitrary 
monomorphic datatypes. In this paper we go one step further and show how to gen- 
eralize the concept to arbitrary first-order polymorphic datatypes. We will answer 
in particular the intriguing question what the generalized trie of a nested datatype 
looks like. Note that this question is not only of theoretical but also of practical 
interest. A number of data structures, such as 2-3 trees or red-black trees, have re- 
cently been shown to be expressible by nested declarations. R. Bird and R. Paterson 
(1998) use a nested datatype for expressing de Bruijn notation. Now, if a look-up 
structure for de Bruijn terms is required, say, to implement common subexpression 
elimination, we are confronted with the problem of constructing generalized tries 
for a nested datatype. 

To develop generalized tries for polymorphic datatypes we will employ the frame- 
work of polytypic programming. In short, a generic or polytypic function is one 
which is defined by induction on the structure of types. A simple example for a 
polytypic function is flatten :: f a — > [a] which traverses an element of / a and 
collects all elements of type a from left to right in a list. The function flatten can 
sensibly be defined for each polymorphic type and it is usually a tiresome, routine 
matter to do so. A polytypic programming language enables the user to program 
flatten once and for all times. The specialization of flatten to concrete instances 
of / is then handled automatically by the system. Polytypic programming can be 
surprisingly simple. In a companion paper (Hinze, 1999) we show that it suffices 
to define a polytypic function on predefined types, type variables, sums, and prod- 
ucts. This information is sufficient to specialize a polytypic function to arbitrary 
datatypes including mutually recursive and nested datatypes. 

Generalized tries make a particularly interesting application of polytypic pro- 
gramming. The central insight is that a trie can be considered as a type-indexed 
datatype. This makes it possible to define tries and operations on tries generically 
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for arbitrary polymorphic datatypes. We already have the necessary prerequisites 
at hand: we know how to define tries for sums and for products. A trie for a sum is 
a product of tries and a trie for a product is a composition of tries. The extension 
to arbitrary datatypes is then uniquely defined. 

We have seen that nested datatypes and polymorphic recursion are necessary 
for the implementation of generalized tries. Implementing tries for polymorphic 
datatypes, especially nested datatypes, places even greater demands on the type 
system: it requires rank-2 type signatures (McCracken, 1984), higher-order poly- 
morphic datatypes (Jones, 1995), and higher-order polymorphic nests. Fortunately, 
all major Haskell system provide the necessary extensions. 

The rest of this paper is structured as follows. In Section 2 we briefly review 
the theoretical background of polytypic programming. A more detailed account is 
given in (Hinze, 1999). Section 3 applies the technique to implement a finite map 
abstraction based on generalized tries. Section 4 discusses variations of the theme. 
Finally, Section 5 reviews related work and points out a direction for future work. 



A polytypic function is one which is parameterised by a datatype. The polytypic 
programming primer therefore starts with a brief investigation of the structure 
of types. The following definitions will serve as running examples throughout the 
paper. 



The meaning of these datatypes in a nutshell: the first equation defines the ubiq- 
uitous datatype of polymorphic lists; Bintree encompasses external binary search 
trees. The types Perfect and Sequ are examples for nested datatypes: Perfect com- 
prises perfect binary leaf trees (Dielissen & Kaldewaij, 1995) and Sequ implements 
binary random-access lists (Okasaki, 1998). Both definitions make use of the aux- 
iliary datatype Fork whose elements may be interpreted as internal nodes. 

Haskell's data construct combines several features in a single coherent form: 
sums, products, and recursion. Using more conventional notation ('+' for sums and 
' x ' for products) and omitting constructor names we obtain the following emaciated 
recursion equations. 



2 A polytypic programming primer 



2.1 Datatypes 



data List a 
data Bintree a\ a-2 
data Fork a 
data Perfect a 
data Sequ a 



Nil | Cons a (List a) 

Leaf a\ \ Node (Bintree a\ 02) a? (Bintree a\ a?) 
Fork a a 

Null a I Succ (Perfect (Fork a)) 

Empty I Zero (Sequ (Fork a)) \ One a (Sequ (Fork a)) 



List a 

Bintree cii 02 
Fork a 
Perfect a 
Sequ a 



1 + a x List a 

ai + Bintree a\ 02 x x Bintree a\ 02 



a + Perfect (Fork a) 

1 + Sequ (Fork a) + a x Sequ (Fork a) 



a x a 
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+ 

A 

a + 

/ \ 
x + 

A / \. 

a a x 

/ \ 
x x 

/ \ / \ 
a a a a 



Perfect a 



Fig. 1. Types interpreted as infinite type expressions. 



In the following we treat 1, '+', and 'x' as if they were given by the following 
datatype declarations. 

data 1 =() 

data «i + a-2 = Inl ai | Inr a^, 
data «i x « 2 = (ai, 02) 

Now, the central idea of polytypic programming is that the set of all types — 
or rather, the set of all type expressions itself can be modelled by a datatype. 
Assuming a fixed set of type variables A = {01,02,03, . . .} and a set of primitive 
type constructors P = {1, Int, ...,+, x } type expressions can be seen as denned by 
the following grammar. 

T = A\P(T,...,T) 
The type F(t\, . . . ,t n ) denotes the application of an n-ary type constructor to n 
types. We omit the parenthesis when n ^ 1. We also write t\ + t 2 for +(£1,^2) 
and similarly t\ x t 2 - Finally, we abbreviate a\ to a when defining unary type 
constructors. 

The question remains how recursive types are modelled. The answer probably 
comes as no surprise to the experienced Haskell programmer: recursive types are 
modelled by infinite type expressions! Figure 1 displays the infinite type expressions 
defined by the equations for List and Perfect. 



2.2 Polytypic definitions 

A polytypic function is defined by induction on the structure of types. In general, 
the definition takes the following form. 

poly(t) :: r(t) 
poly{at) = poly a . 
poly{F(ti,...,t n )) = poly F (poly (h),..., poly (*„)) 

The type parameter is written in angle brackets to distinguish it from ordinary 
parameters. If t is an n-ary type constructor, poly a . must be specified for 1 ^ i ^ n. 
Furthermore, an equation must be given for each primitive type constructor F G P. 
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As an example, the function flatten, which listifies a given structure, can be 
defined as follows. 



The first two equations specify the action of flatten on miliary type constructors, ie 
flatten t = Xa — > [] for each t € {1, Int, . . .}. The third equation defines flatten a = 
A a — > [a]. Finally, flatten + and flatten x are given by 

flatten + ((fi , ip^) = Aa^ case a of {Inl a\ — > ipi a\\Inr 02 — > <^2 ^2} 
flatten x ((fi,(f2) = A(«i , a 2 ) -> «i -H- ^2 «2 • 

This information is sufficient to define a unique function flatten(t) for each (unary) 
type expression t (Courcelle, 1983). Of course, since t may be infinite — and usually 
is — we require that types are interpreted by complete partial orders and functions 
by continuous functions between them. Both conditions are usually met. 

The use of infinite type expressions as index sets for polytypic functions dis- 
tinguishes our approach from previous ones (Jeuring & Jansson, 1996; Jansson 
& Jeuring, 1997), which are based on the initial algebra semantics of datatypes. 
Briefly, our approach has two major advantages: it is simpler (the programmer 
must consider less cases) and it is more general (it covers all first-order polymor- 
phic datatypes). We refer the interested reader to (Hinze, 1999) for a more detailed 
account of the pros and cons. 



The main purpose of a polytypic programming system is to specialize a polytypic 
function poly(t) for different instances of t. Unfortunately, the specialization cannot 
be based on the inductive definition of poly(t) — at least, not directly. Consider the 
following attempt to specialize poly (Perfect a): 

poly (Perfect a) 
= poly (a + Perfect (Fork a)) 
= poly + (poly a , poly (Perfect (Fork a))) 
= poly + (poly a , poly (Fork a + Perfect (Fork (Fork a)))) 
= poly + (poly a , poly + (poly (Fork a), poly (Perfect (Fork (Fork a))))) 

To define poly (Perfect a) we require poly(Perfect (Fork n a)) for each n ^ 1. It is 
probably clear that in general we cannot hope to obtain a finite representation of 



flatten(t) 
flatten(l) a 
flatten(Int) a 
flatten(a) a 
flatten(t\ + t^) (Inl a\) 
flatten(t\ + t^) (Inr 02) 
flatten(t\ x i 2 ) («2, a 2 ) 




2.3 Specializing polytypic definitions 
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poly(t) this way. Instead, we must base the specialization on the representation of 
types, ie on the datatype declarations themselves, which are by necessity finite. 

To exhibit the structure of datatype declarations more clearly we shall rewrite 
them as functor equations. Functor expression of arity n are given by the following 
grammar. 

pn = n 1 ? \ P n \ F k ■ (F?,...,F£) 

By II™ we denote the n-ary projection functor selecting its i-th component. For 
n = 1 and n = 2 we use the following more familiar names: Id = II J , Fst = Ilf , and 
Snd = Elements of P n are predefined functors of arity n, ie P° = {1, Int, . . .} 
and P' 2 = {+, x}. The expression F ■ (Fi, . . . , F^) denotes the composition of an 
A;-ary functor F with functors F t , all of arity n. We omit the parenthesis when n = 1 
and we write Kt instead of t ■ () when n = 0. Finally, we write fa + fa for + • (fa , fa) 
and similarly fa x fa. 

Here are the datatype definitions of Section 2.1 rewritten as functor equations. 

List = Kl + Id x List 

Bintree = Fst + Bintree x Snd x Bintree 

Fork = Id x Id 

Perfect = Id + Perfect ■ Fork 

Sequ = K 1 + Sequ ■ Fork + Id x Sequ ■ Fork 

In essence, functor equations are written in a compositional or 'point-free' style 
while data definitions are written in an applicative or 'pointwise' style. 

Now, the central idea is to define, for each arity n, an n-ary function poly n (f) 
satisfying 

poly n (f) (poly (h),..., poly (t n )) = poly(f(t 1 ,...,t n )) . 

We let function follow type: / is an n-ary functor mapping the types ti,...,t n 
to f(h, . . . , t n ); likewise poly n (f) is an n-ary function mapping the polytypic func- 
tions poly(ti), . . . , poly(t n ) to poly(f(ti, . . . , t n )). It can be shown that the following 
definition satisfies the condition above: 

poly n (f) :: r(ii) x ■ ■ ■ x r(t n ) -> r(/ . . . , *„)) 

P0ly n (^i) = 7Tj 

poly n (F) = poly F 
Poly n (f ■ (gi,---,9k)) = Poly k (f)* (poly n (g!),..., poly n (g k )) . 

where 7Tj (tpi, . . . , <p n ) = tfi is the i-th projection function and '★' denotes n-ary 
composition defined by tp ★ (tpi, . . . , <p n ) = Xa — > tp (p\ a, . . . ,p n a). Note that 
<p* (pi) = p o p>x when n = 1. Furthermore note, that the definition of poly n (f) is 
inductive on the structure of functor expressions. On a more abstract level we can 
view poly n as an interpretation of functor expressions: IT is interpreted by 7r,, F 
by poly F , and '•' by 
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Now, setting ti = a, we can define poly in terms of poly n . 

poly(f) r(f) 
poly(f(a 1 ,...,a n )) = poly „(/) (poly ai , poly a J 

By now we have the necessary prerequisites at hand to define the specialization 
of a polytypic function poly(f(a\, . . . , a n )) for a given instance of /. Assume that 
the type constructor is defined by the system of equations (f\ = e\, . . . ,f m = e m ) 
with f = fi for some i. For each equation /, = e,, where /, is an fc-ary type 
constructor, a function definition of the form poly k (fi) = poly k (ei) is generated. 
The expression poly k (ei) is given by the inductive definition above, additionally 
setting poly k (fi) = poly-k-fi, where polyJz-fi is a new function symbol. Finally, 
the defining equation for poly-f, ie poly-f = polyjn-f (poly ai , . . . ,poly Un ), must be 
added. 

Let us apply the above framework to specialize flatten(t) for t = Perfect. Since 
flatten(t) has a polymorphic type, the auxiliary functions flatten n (f) take poly- 
morphic functions to polymorphic functions. We have, for instance, 

flatten 1 {f) :: (Voi a -> [o]) -> (Vo./ (i a) -> [o]) . 

In other words, flatten 1 (f) has a rank-2 type signature (McCracken, 1984). 

The specialization proceeds entirely mechanically. Using the original constructor 
names and abbreviating type names to their first letter we obtain 

flattenP :: Perfect a — > [a] 

flattenP = flattenlP (Xa ->■ [a]) 

flattenlF :: (Va.i a -> [a]) -> (Va.Fork (t a) -> [a]) 

flattenlF f (Fork a± a 2 ) = / «i -H-/ a- 2 

flattenlP :: (Va.t a ->• [a]) ->■ (\fa. Perfect (t a) -> [a]) 

flattenlP f (Null a) = f a 

flattenlP f (Succ a) = flattenlP (flattenlF f ) a . 

Flattening a perfect tree operates in two stages: while recursing flattenlP constructs 
a tailor-made flattening function flattenlF 1 f of type V 'a. Fork 1 t a — > [a] which is 
eventually applied in the base case. 

Remark. Unfortunately, the above definitions pass neither the Hugs nor the GHC 
type-checker though both accept rank-2 type signatures. The reason is that Haskell 
provides only a limited form of type constructor polymorphism. Consider the subex- 
pression flattenlF f in the last equation. It has type V 'a. Fork (t a) — > [a] which 
is not unifiable with the expected type Va.i' a — > [a]. Since Haskell deliberately 
omits type abstractions from the language of type constructors (Jones, 1995), we 
cannot instantiate t' to Am — > Fork (t u). Fortunately, there is a way out of this 
dilemma. If we assign the following types to flattenlF and flattenlP 

flattenlF :: (v ->■ [w]) -> (Fork v ->■ [w]) 
flattenlP :: (v ->■ [w]) -> (Perfect v ->■ [w]) , 

the above definitions type-check. This trick works as long as the definition of poly(t) 
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does not involve polymorphic recursion (in Section 3.2 we will get to know a poly- 
typic function which is polymorphically recursive). □ 



3 Tries generically 

In this section we apply the framework of polytypic programming to implement gen- 
eralized tries generically for all first-order polymorphic datatypes. We have already 
mentioned the basic idea that generalized tries can be considered as a type-indexed 
datatype. To put this idea in concrete terms we will define a datatype 

Map{*) :: * — > * , 

which assigns a type constructor of kind * — > * to each type constructor of kind *. 
The type Map(k) v represents the set of finite maps from k to v. Based on this 
representation we will implement the following operations. 



empty (k) 
single (k) 
lookup (k) 
insert(k) 
merge (k) 



\/v.Map(k) v 

Mv.k xi)-) Map(k) v 

Mv.k — > Map(k) v — > Maybe v 

\/v.(v 4 « 4 «) 4 I; x « 4 (Map(k) v -> Map(k) v) 
Vv.(v (Map(k) v — > Map(k) v — > Map(k) v) 



The signature of lookup(k) deviates slightly from the one used in the introduction 
to this paper: the look-up function returns a value of type Maybe v instead of v to 
be able to signal that a key is unbound. The functions insert(k) and merge(k) take 
as a first argument a so-called combining function, which is applied whenever two 
bindings have the same key. Typically, the combining form is fst or snd. For finite 
maps of type Map(k) Int addition may also be a sensible choice. Interestingly, we 
will see that the combining function is not only a convenient feature for the user, 
it is also necessary for defining insert(k) and merge(k) generically for all types! 



3.1 Type-indexed tries 

Mathematically speaking, generalized tries are based on the following isomorphisms, 
also known as laws of exponentials. 

1 - ^fin v — Maybe v 

(h + k 2 ) ->fin V = (h ->- fln V) X (k 2 ->-fin V) 
(k x X k 2 ) ->-fin v = h ->- fin (k 2 ->-fin v) 

As Map(k) v represents the set of finite maps from k to v, ie k — >-fi n v, the isomor- 
phisms above can be rewritten as defining equations for Map(k). 



Map(l) = Maybe 

Map {Int) = Patricia. Diet 

Map(k 1 + k 2 ) = Map(kx) x Map(k 2 ) 

Map(k 1 x k 2 ) = Map(kx) ■ Map(k 2 ) 
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We assume the existence of a suitable library implementing finite maps with integer 
keys. Such a library could be based, for instance, on a data structure known as a 
Patricia tree (Okasaki & Gill, 1998). This data structure fits particularly well in 
the current setting since Patricia trees are a variety of tries. For clarity we will 
use qualified names when referring to entities defined in the hypothetical module 
Patricia. 

Building upon the techniques developed in Section 2.3 we can now specialize 
Map{k) for a given instance of k. That is, for each functor / of arity n we will 
define an n-ary higher-order functor Map n (f). For n = 1 we have, for instance, 

Map 1 (* — > *) :: (*—»*)—»(*—»*) . 

The type constructor Map^f) is the generalized trie of a polymorphic datatype. 
It takes as argument the generalized trie of the base type, say, a and yields the 
generalized trie of / a. It may come as a surprise that the framework for specializing 
type-indexed functions is also applicable to type-indexed datatypes. The reason 
is quite simple: the definition of poly n {f) requires only two operations, namely 
projection and composition. However, both operations are also available in the 
world of functors and higher-order functors. 

Let us specialize Map n (f) to the datatypes listed in Section 2.1. For better read- 
ability we abbreviate type names to their first letter and omit the arity of functors, 
ie we write MapL instead of MaplList. 

MapL m = Maybe x m ■ MapL m 

MapB (mi, 1712) = mi x MapB (mi, mi) ■ mi ■ MapB (mi, mi) 

MapF m = m ■ m 

MapP m = m x MapP (MapF m) 

MapS m = Maybe x MapS (MapF m) x m • MapS (MapF m) 

Since Haskell permits the definition of higher-order polymorphic datatypes, the 
higher-order functors above can be directly coded as datatypes. All we have to do 
is to bring the equations into an applicative form. 

data MapL m v = TrieL (Maybe v) (m (MapL m v)) 

data MapB mi mi v = TrieB (mi v) 

(MapB mi mi (mi (MapB mi mi v))) 

These types are the polymorphic variants of MapStr and MapBin defined in the 
introduction, ie we have MapStr = MapL MapChar (since Str = List Char) and 
MapBin = MapB MapStr MapChar (since Bin = Bintree Str Char). Things 
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become interesting if we consider nested datatypes. 

data MapF m v = TrieF (m (m v)) 

data MapP m v = TrieP (m v) 

(MapP (MapF m) v) 

data MapS m v = TrieS (Maybe v) 

(MapS (MapF m) v) 
(m (MapS (MapF m) v)) 

The generalized trie of a nested datatype is a higher-order polymorphic nested 
datatype! The nest is higher-order polymorphic since the type parameter which 
is instantiated in a recursive call ranges over type constructors of kind * — > *. 
By contrast, MapB is a first-order polymorphic nest since its instantiated type 
parameter has kind *. It is quite easy to produce generalized tries which are 
both first- and higher-order nests. If we change the type of Sequ's third con- 
structor to One (Sequ (Fork a)) a, then the third component of TrieS has type 
MapS (MapF m) (m v) and MapS is consequently both a first- and a higher-order 
nest. 



3.2 Empty and singleton tries 

The empty trie is given by 

empty (k) :: Vv.Map(k) v 

empty(l) = Nothing 

empty (Int) = Patricia, empty 

empty (ki + k- 2 ) = (empty (ki), empty (k 2 )) 

empty (ki x k 2 ) = empty (ki) . 

The definition already illustrates several interesting aspects of programming with 
generalized tries. To begin with the polymorphic type of empty(k) is necessary 
to make the definition work. Consider the last equation: empty (k\ x k 2 ), which 
is of type Vv.Map(ki) (Map(k 2 ) v), is defined in terms of empty(ki), which is of 
type Vv.Map(ki) v. That means that empty(k\) is used polymorphically. In other 
words empty(k) makes use of polymorphic recursion! By contrast, the definition 
of flatten(t) given in Section 2.2 also type-checks when the type is restricted to 
t s — > [s] for some fixed type s. 

Since empty (k) has a polymorphic type, empty n (f) takes polymorphic values to 
polymorphic values. We have, for instance, 

empty 1 (f) :: (Vv.Map(k) v) ->■ (\fv.Map(f k) v) 

To obtain a signature which is expressible in Haskell we employ the specification of 
Map n (f), ie Map(f (h, . . . , k n )) = Map n (f) (Map(h), . . . , Map(k n )), additionally 
setting Map(k) = m where m is a fresh type variable. 



empty ^f) 



(Vv.m v) — > (\/v .Map^f) m v) 
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Let us take a look at some examples. 1 

emptyL :: (Wm v) — > MapL m w 
emptyL e = TrieL Nothing e 

emptyF :: (Wm v) — > MapF m w 
emptyF e = TrieF e 

emptyP :: (Wm v) — > MapP m w 
emptyP e = TrieP e (emptyP (emptyF e)) 

The second function, emptyF, illustrates the polymorphic use of the parameter: 
e has type Wm v but is used as an element of m (m w). The last defini- 
tion employs 'higher-order polymorphic' recursion: the recursive call is of type 
(\fv.MapF m v) — > MapP (MapF m) w which is a substitution instance of the 
declared type. The function emptyP illustrates another point: the implementation 
of generalized tries relies in an essential way on lazy evaluation. As an example, 
consider the empty trie for Perfect Int, which is represented by the infinite tree 
(abbreviating Patricia. empty to e) 

TrieP e (TrieP (TrieF e) (TrieP (TrieF (TrieF e)) . . .)) . 

In Section 4.1 we shall discuss a slightly modified representation of generalized tries 
which avoids this problem. 

The singleton trie which contains only a single binding is defined as follows. 

single(k) :: Vv.k x v — > Map(k) v 

single (I) (Q,v) = Just v 

single(Int) (i, v) = Patricia. single (i, v) 

single(k\ + fe) (Inl i\,v) = (single(k\) (i\, v), empty^)) 

single{k\ + fe) (Inr i2, v) = (empty(k\), single^) («2, v)) 

single(k\ x fe) ( ( «i , «2 ) , « ) = single(k\) (i\, single (fe) («2,^)) 

The definition of single(k) is interesting because it falls back on empty (k) in the 
third and the fourth equation. This necessitates that single n (f) is parameterised 
both with single(k) and with empty (k). For n = 1 we obtain the type signature 

single^f) :: (Wm v) 

— > (Vv.k x v — > m v) 

-> (W/ k x v ^ Map 1 (f) m v) . 

Let us again specialize the polytypic function to lists and perfect trees. To improve 
readability we will henceforth present the instances without their type signatures — 
which are, nonetheless, mandatory. 

singleL e s (Nil, v) = TrieL (Just v) e 

singleL e s (Cons i is,v) = TrieL Nothing (s (i, singleL e s (is,v))) 
singleF e s (Fork i\ i2,v) = TrieF (s (i\,s («2,«))) 

1 Note that we use Hugs/GHC syntax for universal quantification (Peyton Jones, 1998), which 
forces us to write (Wm v) — ► MapL in, w instead of (Wm v) — > (iv.MapL m v). 
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singleP e s (Null i, v) = TrieP (s (i, v)) (emptyP (emptyF e)) 

singleP e s (Succ i,v) = TrieP e (singleP (emptyF e) (singleF e s) (i,v)) 

The function singleF illustrates that the 'mechanically' generated definitions can 
sometimes be slightly improved. Since the definition of Fork does not involve sums, 
singleF does not require its first argument which could be safely removed. 



3.3 Lookup 

The look-up function implements the scheme discussed in the introduction. 



lookup (k) 
lookup (1) () t 
lookup (Int) i t 

lookup(ki + k 2 ) (Inl i\) (ti, t 2 ) 
lookup(k 1 + k 2 ) (Inr i 2 ) (h,t 2 ) 
lookup(ki x k 2 ) (h,i 2 ) h 



:: Vv.k — > Map(k) v — > Maybe v 

= t 

= Patricia. lookup i t 

= lookup (ki) ii t\ 

= lookup (k 2 ) «2 t 2 

= do {t 2 <— lookup (ki) ii t\ \ lookup (k 2 ) i 2 t 2 } 



On sums the look-up function selects the appropriate map; on products it 'com- 
poses' the look-up functions for the components. Since lookup(k) has result type 
Maybe v, composition amounts to monad or Kleisli composition (Bird, 1998). Defin- 
ing 

(mi O m 2 ) ai = do { a 2 <— mi ai; m 2 a 2 } 
we may write the last equation more succinctly as 

lookup(ki x k 2 ) (h,i 2 ) = lookup(ki) ii O lookup(k 2 ) i^ . 

Specializing lookup (k) to concrete instances of k is by now probably a matter of 
routine. Here is lookup 1 (f) , s type signature. 

lookup 1 (f) :: (Vv.k — > m v — > Maybe v) 

->■ (V?;./ k ->■ Map 1 {f) m v -> Maybe v) . 

For lists and perfect trees we obtain 

lookupL I Nil ( TrieL tn tc) = tn 

lookupL I (Cons i is) (TrieL tn tc) = (lookupL I is O I i) tc 

lookupF I (Fork h i 2 ) (TrieF tf) = (£ i 2 O £ h) tf 

lookupP I (Null i) ( TrieP ts tc) = I i ts 

lookupP I (Succ i) (TrieP ts tc) = lookupP (lookupF I) i tc . 

Note that lookupL generalizes lookupStr defined in the introduction to this paper; 
we have lookupStr s = from Just o lookupL lookupChar s where from Just is given 
by from Just (Just a) = a. The definition of lookupP employs the same recursion 
scheme as flattenlP: while recursing, lookupP constructs a tailor-made look-up 
function lookupF 1 I of type Fork 1 k — > MapF 1 w — > Maybe w which is finally 
applied in the base case. 
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3.4 Inserting and merging 

Insertion is defined in terms of merge(k) and single (k). 

insert(k) :: Mv.(v —>•?;—>•?;)— J-fcx?;—)- (Map(k) v — > Map(k) v) 

insert(k) c (i,v) t = merge(k) c (single(k) (i,v)) t 

Note that this is not the most efficient implementation of insert(k) since singleton 
tries are in general given by infinite trees. This implies that the running time of 
insert(k) is not proportional to the size of the inserted element as one would ex- 
pect. The problem vanishes, however, if we employ the alternative representation 
of generalized tries to be introduced in Section 4.1. 

Merging two tries is surprisingly simple. Given an auxiliary function for combin- 
ing two values of type Maybe a 

combine :: (a — > a — > a) 

— > (Maybe a — > Maybe a — > Maybe a) 

combine c Nothing Nothing = Nothing 

combine c Nothing ( Just a') = Just a' 

combine c (Just a) Nothing = Just a 

combine c (Just a) (Just a') = Just (c a a') , 

we can define merge(k) as follows. 

merge(k) :: \/v.(v —>«—>«) 

-» (Map(k) v -» Map{k) v -» Map{k) v) 

merge(l) c t t' = combine c t t' 

merge (Int) ctt' = Patricia. merge c t t' 

merge(ki + fe) c (t\, fe) (t[, t' 2 ) = (merge{k\) c t\ t[, merge^k-i) c ti t' 2 ) 

merge{k\ x fe) c t t' = merge(ki) (merge^) c) t t' 

The most interesting equation is the last one. The tries t and t' are of type 
Map(ki x fc 2 ) v = Map(ki) (Map(kf2) v). To merge them we can use merge(ki); 
we must, however, supply a combining function of type Map(kf2) v — > Map(kf2) v — > 
Map{k2) v. A moment's reflection reveals that that merged) c is the desired com- 
bining function. Using functional composition we can write the last equation quite 
succinctly as 

merge(ki x fc 2 ) = merge(ki) o merged) . 

The definition of merge(k) shows that it is sometimes necessary to implement op- 
erations more general than actually needed. If merge(k) had the simplified type 
Map(k) v — > Map(k) v — > Map(k) v, then we would not be able to give a defining 
equation for k = k\ x ki . 

To complete the picture let us again specialize the merging operation for lists 
and perfect trees. To begin with here is merge 1 (f) , s type signature. 



merge x lj) :: (\/v.(v — > v — > v) — > (m v — > m v — > m v)) 

->■ (\fv.(v — ^ v — ^ — ^ (Map 1 (f) m v -> Map 1 ({) m v -> Map 1 (f) m 
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The different instances of merge 1 (f) are surprisingly concise. 

mergeL m c (TrieL tn\ tc\) (TrieL tn 2 tc 2 ) 

= TrieL (combine c tn\ tn 2 ) (m (mergeL m c) tc\ tc 2 ) 

mergeF m c (TrieF tf±) (TrieF tf 2 ) 
= TrieF (m (m c) tf 1 tf 2 ) 

mergeP m c (TrieP ts\ tc\) (TrieP ts 2 tc 2 ) 

= TrieP (m c tsi ts 2 ) (mergeP (mergeF m) c tc\ tc 2 ) 



4 Variations of the theme 
4-1 Spotted tries 

The representation of tries as defined in the previous section has two major draw- 
backs: (i) it relies in an essential way on lazy evaluation and (ii) it is inefficient. 
Both disadvantages have their roots in the representation of tries on sums. A trie 
on ki + k 2 is a pair of tries irrespective of whether the trie is empty or not. This 
suggests to devise a special representation for the empty trie. Technically, this is 
achieved using so-called spot products (Connelly & Lockwood Morris, 1995). 

data «i x. a 2 = Spot \ Pair a\ a 2 
Spot products are also known as optional pairs. Changing Map(k)'s definition to 

Map{ki+k 2 ) = Map(h) x. Map{k 2 ) 
we can now represent the empty trie in constant space. 

empty (ki + k 2 ) = Spot 

This representation is, of course, no longer unique. Therefore, we require that the 
empty trie on sums is always represented by Spot. Maintaining this invariant in our 
implementation is, however, trivial since tries never shrink. The situation would be 
different if we additionally supplied an operation for removing bindings from a trie. 
The remaining operations must be modified accordingly. 

single(ki + k 2 ) (Inl i\,v) = Pair (single(ki) (ii,v)) (empty(k 2 )) 

single(ki + k 2 ) (Inr i2,v) = Pair (empty(ki)) (single(k 2 ) (i 2 ,v)) 

lookup(ki + k 2 ) k Spot = Nothing 

lookup(ki + k 2 ) (Inl i\) (Pair t\ t 2 ) = lookup(k\) i\ t\ 
lookup(k\ + k 2 ) (Inr i 2 ) (Pair t\ t 2 ) = lookup(k 2 ) i 2 t 2 

merge(k\ + k 2 ) c Spot t' = t' 

merge(k\ + k 2 ) c t Spot = t 

merge(k\ + k 2 ) c (Pair t\ t 2 ) (Pair t[ t 2 ) 

= Pair (merge(ki) c t\ t[) (merge(k 2 ) c t 2 t 2 ) 

Figure 2 contains the complete code for generalized tries on binary random-access 
lists building on the above representation. Some remarks are appropriate. First of 
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{- Generalized tries for polymorphic binary random-access lists -} 

data MapS m v = SpotS | TrieS (Maybe v) 

(MapS (MapF m) v) 
(m (MapS (MapF m) v)) 

MapS m w 
SpotS 

(Vu.m v) 

(yv.k x v — > m v) 
Sequ k x w —¥ MapS m w 
TrieS (Just v) emptyS e 
TrieS Nothing 

(singleS (emptyF e) (singleF e s) (x, v)) 
e 

TrieS Nothing 
emptyS 

(s (i, singleS (emptyF e) (singleF e s) (x,v))) 

(\fv.k — > rn v — > Maybe v) 
— > Sequ k —¥ MapS m w —¥ Maybe w 
lookupS I Empty (TrieS te tz to) 
= te 

lookupS I (Zero x) (TrieS te tz to) 

= lookupS (lookupF £) x tz 
lookupS I (One i x) (TrieS te tz to) 

= (lookupS (lookupF £) x O £ i) to 

mergeS :: (Wv.(v — > v — > v) — > (rn v —¥ m v — > m v)) 

— > (w — > w — > w) —¥ (MapS m w —¥ MapS m w —¥ MapS m w) 
mergeS m c SpotS t' = t' 
mergeS met SpotS = t 
mergeS m c (TrieS te\ tz\ to\) (TrieS tei tz2 £02) 

= TrieS (combine c te\ te-i) 

(mergeS (mergeF m) c tz\ tz-i) 
(m (mergeS (mergeF m) c) to\ to?) 
{- Generalized tries for binary random-access lists over integers -} 



emptyS 
emptyS 

singleS 



singleS e s (Empty, v) 
singleS e s (Zero x, v) 



singleS e s (One i x, v) 
lookupS 



type MapSI 

emptySI 
emptySI 

singleSI 
singleSI 

lookupSI 
lookupSI 

insertSI 
insertSI c b t 

mergeSI 
mergeSI 



MapS Patricia. Diet 

MapSI v 
emptyS 

Sequ Int x v — > MapSI v 

singleS Patricia. empty Patricia. single 

Sequ Int —¥ MapSI w —¥ Maybe w 
lookupS Patricia. lookup 

(v — > v — > v) — > Sequ Int x v — > (MapSI v —¥ MapSI v) 
mergeSI c (singleSI b) t 

(v — > v — > v) — > (MapSI v — > MapSI v —¥ MapSI v) 
mergeS Patricia. merge 



Fig. 2. Generalized tries for binary random-access lists. 
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all, the datatype MapS is based on the functor equation 

MapS m = Maybe x. MapS (MapF m) x. m ■ MapS (MapF m) . 

For simplicity we interpret a\ x . a 2 x . a% as the type of optional triples and not as 
nested optional pairs. 

data a\ x. a-i x. az = Spot \ Triple a\ a 2 az 

The definition of emptyS has been simplified by omitting its parameter, which is 
not required. Finally, note that we have not listed the implementation of generalized 
tries for the datatype Fork. Since Fork's definition does not involve sums, the code 
is identical to that given in Section 3. 



4-2 Skinny tries 



Extending the idea of the previous section one step further we could additionally 
devise a special representation for singleton tries. 

data fli .x. a 2 = None \ Onlyl ai \ Onlyr a 2 \ Both a\ a 2 
Using, x. instead of x. has the advantage that single(k) need not refer to empty (k). 



single(ki 
single(ki 



k 2 ) (Inl i\, v) 
k 2 ) (Inr i 2 , v) 



Onlyl (single(ki) (ii,v)) 
Onlyr (single(k 2 ) ( , h,v)) 



This representation is furthermore a bit more space economical. A potential disad- 
vantage is the increased number of cases one must consider when defining lookup (k) 
and merge{k). 



lookup (ki + k 2 ) (Inl i\) None 
lookup (ki + k 2 ) (Inl i\) (Onlyl t\) 
lookup (ki + k-2) (Inl i\) (Onlyr £2) 
lookup(ki + k-2) (Inl i\) (Both t\ fc) 

merge(ki + fe) c (Onlyl h) None 
merge(k\ + fe) c (Onlyl h) (Onlyl t[) 
merge(k\ + fe) c (Onlyl h) (Onlyr t' 2 ) 
merge(k\ + fe) c (Onlyl h) (Both t[ t^) 

The remaining cases are defined accordingly. 



Nothing 
lookup (k{) i\ t\ 
Nothing 
lookup (ki) i\ t\ 

Onlyl t\ 

Onlyl (merge(k\) c t\ t[) 
Both h t' 2 

Both (merge(k\) c t\ t[) t' 2 



5 Related and future work 

D.E. Knuth (1998) attributes the idea of a trie to A. Thue who introduced it 
in a paper about strings that do not contain adjacent repeated substrings. R. de 
la Briandais recommended tries for computer searching (1959). The generaliza- 
tion of tries from strings to elements of an arbitrary datatype was discovered 
by CP. Wadsworth (1979) and others independently since. R.H. Connelly and 
F.L. Morris (1995) formalized the concept of a trie in a categorical setting: They 
showed that a trie is a functor and that the corresponding look-up function is 
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a natural transformation. Interestingly, despite the framework of category theory 
they base the development on many-sorted signatures which makes the definitions 
somewhat unwieldy. This paper shows that the construction of generalized tries is 
much simpler if we replace to concept of a many-sorted signature by its categorical 
counterpart, the concept of a functor. 

The first implementation of generalized tries was given by C. Okasaki in his recent 
textbook on functional data structures (1998). Tries for polymorphic types like lists 
or binary trees are represented as Standard ML functors. While this approach works 
for regular datatypes it fails for nested datatypes such as Perfect or Sequ. In the 
latter case higher-order polymorphic datatypes are indispensable. 

That said, a direction for future work suggests itself, namely to generalize tries 
to arbitrary higher-order polymorphic datatypes. To give an impression of the ex- 
tensions consider the standard definition of rose trees. 

data Rose k = Branch k (List (Rose k)) 

Its trie is given by 

data MapR mk v = TrieR (mk (MapL (MapR mk) v)) . 

Now, abstracting the list functor away we obtain the following generalization of 
rose trees. 

data GRose t k = GBranch k (t (GRose t k)) 
The trie of Rose can be generalized in a similar way. 

data MapGR mt mk v = TrieGR (mk (mt (MapGR mt mk) v)) 

Note that GRose is a type constructor of kind (*—»*)—»(*—» *) while its 
trie has kind ((* —»*)—»(*—» *)) — > ((* —>*)—>(* — > *)). Now, the same 
systematics can be applied to generalize the operations on MapR to operations on 
MapGR. Currently, the author is working on a suitable extension of the framework 
which allows to define polytypic functions generically for all datatypes expressible 
in Haskell. 
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